34 research outputs found
The Zero Resource Speech Challenge 2017
We describe a new challenge aimed at discovering subword and word units from
raw speech. This challenge is the followup to the Zero Resource Speech
Challenge 2015. It aims at constructing systems that generalize across
languages and adapt to new speakers. The design features and evaluation metrics
of the challenge are presented and the results of seventeen models are
discussed.Comment: IEEE ASRU (Automatic Speech Recognition and Understanding) 2017.
Okinawa, Japa
Early phonetic learning without phonetic categories: Insights from large-scale simulations on realistic input
International audienceBefore they even speak, infants become attuned to the sounds of the language(s) they hear, processing native phonetic contrasts more easily than non-native ones. For example, between 6-8 months and 10-12 months, infants learning American English get better at distinguishing English [ɹ] and [l], as in ‘rock’ vs ‘lock’, relative to infants learning Japanese. Influential accounts of this early phonetic learning phenomenon initially proposed that infants group sounds into native vowel- and consonant-like phonetic categories—like [ɹ] and [l] in English—through a statistical clustering mechanism dubbed ‘distributional learning’. The feasibility of this mechanism for learning phonetic categories has been challenged, however. Here we demonstrate that a distributional learning algorithm operating on naturalistic speech can predict early phonetic learning as observed in Japanese and American English infants, suggesting that infants might learn through distributional learning after all. We further show, however, that contrary to the original distributional learning proposal, our model learns units too brief and too fine-grained acoustically to correspond to phonetic categories. This challenges the influential idea that what infants learn are phonetic categories. More broadly, our work introduces a novel mechanism-driven approach to the study of early phonetic learning, together with a quantitative modeling framework that can handle realistic input. This allows, for the first time, accounts of early phonetic learning to be linked to concrete, systematic predictions regarding infants’ attunement
Clinical and virological features of acute HBV-related hepatitis in southern Vietnam
Background: Despite the availability of effective vaccines, hepatitis B virus (HBV) infection is still frequent worldwide, and accounts for significant morbidity and mortality. However, data of acute HBeAg negative hepatitis still remain limited. Aims and Methods: To understand clinical pictures of acute HBV hepatitis and its natural evolution, a prospective study was conducted in adult patients. Results: Ninety patients were enrolled between March 2004 and April 2005 at Hospital for Tropical Diseases in Ho Chi Minh city. The prevalence of HBeAg negative was 53%. No significant difference was found in clinical characteristics and laboratory findings between HBeAg positive and negative patients. HBV-DNA was detected in 75% and 88% HBe negative and positive patients, respectively, where the frequency of ALT below 400 U/L was significantly higher in HBeAg negative cases (p = 0.01). Six month follow-up was available in 47 patients. HBsAg positivity was found in 16% of HBeAg negative subjects but only in 4.5% of HBeAg positive cases. Thirty two patients had neither HBsAg nor anti-HBs. Conclusions: The clinical and laboratory feature and the ou
Vocal markers from sustained phonation in Huntington's Disease
Disease-modifying treatments are currently assessed in neurodegenerative
diseases. Huntington's Disease represents a unique opportunity to design
automatic sub-clinical markers, even in premanifest gene carriers. We
investigated phonatory impairments as potential clinical markers and propose
them for both diagnosis and gene carriers follow-up. We used two sets of
features: Phonatory features and Modulation Power Spectrum Features. We found
that phonation is not sufficient for the identification of sub-clinical
disorders of premanifest gene carriers. According to our regression results,
Phonatory features are suitable for the predictions of clinical performance in
Huntington's Disease.Comment: To appear at INTERSPEECH 2020. 1 pages of supplementary material
appear only in the arxiv version. Code to replicate
https://github.com/bootphon/sustained-phonation-feature
Baby Cloud, a technological platform for parents and researchers
International audienceIn this paper, we present BabyCloud, a platform for capturing, storing and analyzing day-long audio recordings and photographs of children's linguistic environments, for the purpose of studying infant's cognitive and linguistic development and interactions with the environment. The proposed platform connects two communities of users: families and academics, with strong innovation potential for each type of users. For families, the platform offers a novel functionality: the ability for parents to follow the development of their child on a daily basis through language and cognitive metrics (growth curves in number of words, verbal complexity, social skills, etc). For academic research, the platform provides a novel means for studying language and cognitive development at an unprecedented scale and level of detail. They will submit algorithms to the secure server which will only output anonymized aggregate statistics. Ultimately, BabyCloudaims at creating an ecosystem of third parties (public and private research labs...) gravitating around developmental data,entirely controlled by the party whose data originate from, i.e. familie
The Zero Resource Speech Challenge 2019: TTS without T
International audienceWe present the Zero Resource Speech Challenge 2019, which proposes to build a speech synthesizer without any text or pho-netic labels: hence, TTS without T (text-to-speech without text). We provide raw audio for a target voice in an unknown language (the Voice dataset), but no alignment, text or labels. Participants must discover subword units in an unsupervised way (using the Unit Discovery dataset) and align them to the voice recordings in a way that works best for the purpose of synthesizing novel utterances from novel speakers, similar to the target speaker's voice. We describe the metrics used for evaluation , a baseline system consisting of unsupervised subword unit discovery plus a standard TTS system, and a topline TTS using gold phoneme transcriptions. We present an overview of the 19 submitted systems from 10 teams and discuss the main results
The Zero Resource Speech Challenge 2020: Discovering discrete subword and word units
International audienceWe present the Zero Resource Speech Challenge 2020, which aims at learning speech representations from raw audio signals without any labels. It combines the data sets and metrics from two previous benchmarks (2017 and 2019) and features two tasks which tap into two levels of speech representation. The first task is to discover low bit-rate subword representations that optimize the quality of speech synthesis; the second one is to discover word-like units from unsegmented raw speech. We present the results of the twenty submitted models and discuss the implications of the main findings for unsupervised speech learning
Vocal markers from sustained phonation in Huntington's Disease
To appear at INTERSPEECH 2020. 1 pages of supplementary material appear only in the arxiv version. Code to replicate https://github.com/bootphon/sustained-phonation-featuresInternational audienceDisease-modifying treatments are currently assessed in neurodegenerative diseases. Huntington's Disease represents a unique opportunity to design automatic sub-clinical markers, even in premanifest gene carriers. We investigated phonatory impairments as potential clinical markers and propose them for both diagnosis and gene carriers follow-up. We used two sets of features: Phonatory features and Modulation Power Spectrum Features. We found that phonation is not sufficient for the identification of sub-clinical disorders of premanifest gene carriers. According to our regression results, Phonatory features are suitable for the predictions of clinical performance in Huntington's Disease
Memoria, remoción, olvido del estalinismo en la Rusia postsoviética
El estalinismo - aquí entendido como el sistema político y de gobierno instaurado por Stalin en la Unión Soviética y, cronológicamente, como el período de la historia soviética durante el cual Stalin ejerció un poder casi absoluto- ha representado una experiencia profundamente traumática para la sociedad rusa y soviética en general, así como para los países de Europa Central y Oriental incluidos después de 1945 en la esfera de influencia soviética. Para Rusia el estalinismo significó, por un lado, una transformación radical y violenta de la sociedad, por el otro, un terrorismo de Estado y un conjunto de represiones políticas que provocaron millones de víctimas. El recuerdo de este pasado permanece como un problema no resuelto de la memoria rusa. La sociedad rusa postsoviética continúa, de hecho, profundamente dividida a propósito de este “pasado que no pasa”, con el que no ha podido realmente lidiar: la división tiene que ver más con la interpretación que con los hechos, y qué sentido se le atribuye a ese pasado. Prevalece una actitud ambivalente, tanto en el gobierno cómo en la mayoría de la población. En estas páginas analizamos el recorrido de la memoria del estalinismo en Rusia desde la Perestroika hasta hoy, y los fenómenos de remoción, olvido y silencio que lo acompañaron.“Memoria, rimozione, oblio dello stalinismo nella Russia postsovietica” fue previamente publicado
en italiano en el Giornale di Storia Contemporanea , año XIX, número 1, 2016, pp. 7-32.Facultad de Humanidades y Ciencias de la Educació
Risk Factors of Streptococcus suis Infection in Vietnam. A Case-Control Study
Background: Streptococcus suis infection, an emerging zoonosis, is an increasing public health problem across South East Asia and the most common cause of acute bacterial meningitis in adults in Vietnam. Little is known of the risk factors underlying the disease. Methods and Findings: A case-control study with appropriate hospital and matched community controls for each patient was conducted between May 2006 and June 2009. Potential risk factors were assessed using a standardized questionnaire and investigation of throat and rectal S. suis carriage in cases, controls and their pigs, using real-time PCR and culture of swab samples. We recruited 101 cases of S. suis meningitis, 303 hospital controls and 300 community controls. By multivariate analysis, risk factors identified for S. suis infection as compared to either control group included eating "high risk" dishes, including such dishes as undercooked pig blood and pig intestine (OR1 = 2.22; 95% CI = [1.15-4.28] and OR2 = 4.44; 95% CI = [2.15-9.15]), occupations related to pigs (OR1 = 3.84; 95% CI = [1.32-11.11] and OR2 = 5.52; 95% CI = [1.49-20.39]), and exposures to pigs or pork in the presence of skin injuries (OR1 = 7.48; 95% CI = [1.97-28.44] and OR2 = 15.96; 95% CI = [2.97-85.72]). S. suis specific DNA was detected in rectal and throat swabs of 6 patients and was cultured from 2 rectal samples, but was not detected in such samples of 1522 healthy individuals or patients without S. suis infection. Conclusions: This case control study, the largest prospective epidemiological assessment of this disease, has identified the most important risk factors associated with S. suis bacterial meningitis to be eating 'high risk' dishes popular in parts of Asia, occupational exposure to pigs and pig products, and preparation of pork in the presence of skin lesions. These risk factors can be addressed in public health campaigns aimed at preventing S. suis infectio